Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
                                            Some full text articles may not yet be available without a charge during the embargo (administrative interval).
                                        
                                        
                                        
                                            
                                                
                                             What is a DOI Number?
                                        
                                    
                                
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
- 
            Abstract Microbial processes are crucial in producing and oxidizing biological methane (CH4) in natural wetlands. Therefore, modeling methanogenesis and methanotrophy is advantageous for accurately projecting CH4cycling. Utilizing the CLM‐Microbe model, which explicitly represents the growth and death of methanogens and methanotrophs, we demonstrate that genome‐enabled model parameterization improves model performance in four natural wetlands. Compared to the default model parameterization against CH4flux, genomic‐enabled model parameterization added another contain on microbial biomass, notably enhancing the precision of simulated CH4flux. Specifically, the coefficient of determination (R2) increased from 0.45 to 0.74 for Sanjiang Plain, from 0.78 to 0.89 for Changbai Mountain, and from 0.35 to 0.54 for Sallie's Fen, respectively. A drop inR2was observed for the Dajiuhu nature wetland, primarily caused by scatter data points. Theil's coefficient (U) and model efficiency (ME) confirmed the model performance from default parameterization to genome‐enabled model parameterization. Compared with the model solely calibrated to surface CH4flux, additional constraints of functional gene data led to better CH4seasonality; meanwhile, genome‐enabled model parameterization established more robust associations between simulated CH4production rates and environmental factors. Sensitivity analysis underscored the pivotal role of microbial physiology in governing CH4flux. This genome‐enabled model parameterization offers a valuable promise to integrate fast‐cumulating genomic data with CH4models to better understand microbial roles in CH4in the era of climate change.more » « less
- 
            Fu, Yan (Ed.)Abstract Advances in mass spectrometry (MS) have enabled high-throughput analysis of proteomes in biological systems. The state-of-the-art MS data analysis relies on database search algorithms to quantify proteins by identifying peptide–spectrum matches (PSMs), which convert mass spectra to peptide sequences. Different database search algorithms use distinct search strategies and thus may identify unique PSMs. However, no existing approaches can aggregate all user-specified database search algorithms with a guaranteed increase in the number of identified peptides and a control on the false discovery rate (FDR). To fill in this gap, we proposed a statistical framework, Aggregation of Peptide Identification Results (APIR), that is universally compatible with all database search algorithms. Notably, under an FDR threshold, APIR is guaranteed to identify at least as many, if not more, peptides as individual database search algorithms do. Evaluation of APIR on a complex proteomics standard dataset showed that APIR outpowers individual database search algorithms and empirically controls the FDR. Real data studies showed that APIR can identify disease-related proteins and post-translational modifications missed by some individual database search algorithms. The APIR framework is easily extendable to aggregating discoveries made by multiple algorithms in other high-throughput biomedical data analysis, e.g., differential gene expression analysis on RNA sequencing data. The APIR R package is available at https://github.com/yiling0210/APIR.more » « less
- 
            ABSTRACT: Motivation Single-cell RNA sequencing (scRNA-seq) captures whole transcriptome information of individual cells. While scRNA-seq measures thousands of genes, researchers are often interested in only dozens to hundreds of genes for a closer study. Then, a question is how to select those informative genes from scRNA-seq data. Moreover, single-cell targeted gene profiling technologies are gaining popularity for their low costs, high sensitivity and extra (e.g. spatial) information; however, they typically can only measure up to a few hundred genes. Then another challenging question is how to select genes for targeted gene profiling based on existing scRNA-seq data. Results Here, we develop the single-cell Projective Non-negative Matrix Factorization (scPNMF) method to select informative genes from scRNA-seq data in an unsupervised way. Compared with existing gene selection methods, scPNMF has two advantages. First, its selected informative genes can better distinguish cell types. Second, it enables the alignment of new targeted gene profiling data with reference data in a low-dimensional space to facilitate the prediction of cell types in the new data. Technically, scPNMF modifies the PNMF algorithm for gene selection by changing the initialization and adding a basis selection step, which selects informative bases to distinguish cell types. We demonstrate that scPNMF outperforms the state-of-the-art gene selection methods on diverse scRNA-seq datasets. Moreover, we show that scPNMF can guide the design of targeted gene profiling experiments and the cell-type annotation on targeted gene profiling data. Availability and implementation The R package is open-access and available at https://github.com/JSB-UCLA/scPNMF. The data used in this work are available at Zenodo: https://doi.org/10.5281/zenodo.4797997. Supplementary information Supplementary data are available at Bioinformatics online.more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
